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Abstract. A decade ago cosmological simulations of increasingly higher resolu- 
tion were used to demonstrate that virialized regions of Cold Dark Matter (CDM) 
halos are filled with a multitude of dense, gravitationally-bound clumps. These 
dark matter subhalos are central regions of halos that survived strong gravitational 
tidal forces and dynamical friction during the hierarchical sequence of merging 
and accretion via which the CDM halos form. Comparisons with observations 
revealed that there is a glaring discrepancy between abundance of subhalos and 
luminous satellites of the Milky Way and Andromeda as a function of their circu- 
lar velocity or bound mass within a fixed aperture. This large discrepancy, which 
became known as the "substructure" or the "missing satellites" problem, begs for 
an explanation. In this paper I review the progress made during the last several 
years both in quantifying the problem and in exploring possible scenarios in which 
it could be accommodated and explained in the context of galaxy formation in the 
framework of the CDM paradigm of structure formation. In particular, I show that 
the observed luminosity function, radial distribution, and the remarkable similar- 
ity of the inner density profiles of luminous satellites can be understood within 
hierarchical CDM framework using a simple model in which efficiency of star for- 
mation monotonically decreases with decreasing virial mass satellites had before 
their accretion without any actual sharp galaxy formation threshold. 
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1 Introduction 

In the hierarchical scenario of galaxy forma- 
tion [T], theoretically rooted in the Cold Dark 
Matter (CDM) structure formation model [i2J, 
galaxies form via cooling and condensation of 
gas in dark matter halos, which grow via an hi- 
erarchical sequence of mergers and accretion. 
The density perturbations in these models have 
amplitude that increases with decreasing scale 
down to ~ 1 comoving parsec or below [|3l, 
with the smallest fluctuation scale defined by 
the specific properties of the particles assumed 
to constitute the majority of the CDM. Smaller 
perturbations thus collapse first and then grow 
and merge to form larger and larger objects, 
with details of the evolution determined by ex- 
pansion history of the universe (i.e., by param- 
eters describing the background cosmological 
model) and by the shape of the density fluctua- 
tion power spectrum [4J. 

An example of such evolution in the flat 
ACDM model is illustrated in Figure [1} which 
shows collapse of a « 10^^ Mq object. The fig- 
ure shows that during the early stages of evo- 
lution the matter that is incorporated into the 
final halo collapses into a large number of rel- 
atively small clumps with a filamentary, web- 
hke spatial distribution. Further evolution, me- 
diated by the competition between gravity and 
expansion of space, is a sequence of accretion 
and mergers that builds objects of progressively 
larger mass until the single system is formed 
during the last several billion years of evolu- 
tion. The figure also shows that cores of some 
of the small clumps that merge with and are 
incorporated into larger objects survive[^ until 
later epochs and are present in the form of halo 
substructure or subhalos: small dense clumps 
within virialized regions of larger halos. 

The CDM model of structure formation is re- 

^ Survival of subhalos is not a trivial result and is due to 
a relatively compact distribution of mass in CDM halos. 
Ensuring their survival requires a rather large dynamic 
range in spatial and mass resolution, which had not been 
achieved until the late 1990s ||5,|6ii7J. 



markably successful in explaining a wide range 
of observations from temperature fluctuations 
of the cosmic microwave background tSJ to 
galaxy clustering and its evolution (9) both 
qualitatively and, in many cases, quantitatively. 
Nevertheless, many key details of the model are 
still being developed [fTDl fTTI and its testing is 
by no means complete. 

One area of active investigation is testing 
predictions of the CDM models at scales from 
a few kpc to tens of pc (i.e., the smallest scales 
probed by observations of galaxies). In par- 
ticular, there is still tension between predic- 
tions of the central mass distribution in galax- 
ies [ fT2l [T3|| and sizes and angular momenta 
of galactic disks and observational results [|T4l . 
Notably, this tension has not gone away during 
the past 10-15 years, even though both theo- 
retical models and observations have improved 
dramatically. 

Another example of tension between CDM 
predictions and observations that has been ac- 
tively explored during the last decade is the fact 
that satellite systems around galaxies of difi'er- 
ent luminosity are qualitatively different, even 
though their dark matter halos are expected to 
be approximately scaled down versions of each 
other [15], with their total mass as the scal- 
ing parameter. Faint dwarf galaxies usually 
have no luminous satellites at all. Milky Way 
and Andromeda have a few dozen, but clusters 
of galaxies often have thousands of satellites 
around the brightest cluster galaxy. 

The number of gravitationally-bound satel- 
lite subhalos at a fixed mass relative to the 
mass of their host CDM halo, on the other 
hand, is expected to be approximately the same 
[HH [nl [lEi. This is illustrated in Figure |2| 
which shows distribution of dark matter out 
to approximately two virial radii around the 
centers of two CDM halos of masses different 
by two orders of magnitude. It is clear that 
it is not easy to tell the mass of the halo by 
simply examining the overall mass distribution 
or by counting the number of subhalos. This 



Fig. 1 — Formation of a Milky Way-sized dark matter halo in a cosmological simulation of flat ACDM cosmology 
(Qm = 1 - Qa = 0.3, h - 0.7, erg - 0.9). The panels show an evolutionary sequence at nine redshifts (from left to 
right and from top to bottom) focusing on the most massive progenitor of the main halo at each epoch (redshift of 
each epoch is shown in the left upper corner). The rendering shows the dark matter particles with intensity indicating 
the local matter density on a logarithmic stretch. The build-up of the halo proceeds through a series of spectacular 
mergers, particularly frequent in the early stages of evolution. Many of the merging clumps survive until the present 
epoch (z - 0) in the form of "substructure". The size of the region shown is about 3 comoving Mpc at z - 15, 
monotonically zooming in to a scale of a; 1 comoving Mpc across at z = 0. 



is a visual manifestation of approximate self- 
similarity of CDM halos of different mass. If 
we would compare similar images of distribu- 
tion of luminous matter around galaxies and 
clusters, the difference would be striking. 

The manifestly different observed satelhte 



populations around galaxies of different lu- 
minosities and expected approximately self- 
similar populations of satellite subhalos around 
halos of different mass is known as the sub- 
structure problem. [|19l|20l[T6l. In the case of 
the best studied satellite systems of the Milky 



Fig. 2 — Comparison of two z - Q halos of masses 3 x 10'^ Mq and 3 x 10'^ Mq formed in flat ACDM cosmology 
(Qm = 1 - Oa - 0.3, h = 0.7, cTg = 0.9). In each case the mass distribution around the center of the halo is shown 
to approximately two virial radii from the center of each halo. Both objects were resolved with similar number of 
particles and similar spatial resolution relative to the virial radius of the halo in their respective simulations. I leave it 
as an exercise to the reader to guess the mass of the halo shown in each panel. 



Way and Andromeda galaxies, the discrepancy 
between the predicted abundance of small-mass 
dark matter clumps and the number of observed 
luminous satellites as a function of circular ve- 
locity (see § |2]) has been also referred to as the 
"missing satellites problem.'^ The main goal 
of this paper is to review theoretical and ob- 
servational progress in quantifying and under- 
standing the problem over the last decade. 

2 Quantifying the substructure and luminous 
satellite populations 

In order to connect theoretical predictions and 
observations on a quantitative level, we need 
descriptive statistics to characterize population 
of theoretical dark matter subhalos and ob- 
served luminous satellites. Ideally, one would 
like theoretical models to be able to predict 
properties of stellar populations hosted by dark 

^ The name derived from the title "Where are the miss- 
ing galactic satellites?" of one of the papers originally 
pointing out the discrepancy L20J . 



matter halos and subhalos and make compar- 
isons using statistics involving directly observ- 
able quantities, such as galaxy luminosities. In 
practice, however, this is difficult as such pre- 
dictions require modeling of still rather uncer- 
tain processes shaping properties of galaxies 
during their formation. In addition, the simu- 
lations can reach the highest resolution in the 
regime when complicated and computationally 
costly galaxy formation processes are not in- 
cluded and all of the matter in the universe is 
modeled as a uniform coUisionless and dissi- 
pationles^ component. Such simulations thus 
give the most accurate knowledge of the dark 
matter subhalo populations, but can only pre- 
dict dynamical subhalo properties such as the 
depth of their potential well or the total mass 



That is the component that cannot dissipate the kinetic 
energy it acquires during gravitational collapse and ac- 
companying gravitational interaction and relaxation pro- 
cesses. 
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of gravitationally bound material. Therefore, 
in comparisons between theoretical predictions 
and observations so far, the most common strat- 
egy was to find a compromise quantity that can 
be estimated both in dissipationless simulations 
and in observations. 

2.1 Quantifying the subhalo populations. Start- 
ing with the first studies that made such com- 
parisons using results of numerical simulations 
[|20[ [T6ll the quantity of choice was the maxi- 
mum circular velocity, defined as 



Vmax = max 



Gm(< r) 



1/2 



(1) 



where m(< r) = An J p{r)r^dr is the spheri- 
cally averaged total mass profile about the cen- 
ter of the object, y^ax is a measure of the depth 
of the potential (the potential energy of a self- 
gravitating system is V7 oc Vl^^J and can be 
fairly easily computed in a cosmological sim- 
ulation once the center of a subhalo is deter- 
mined Q The attractive feature of Vmax is that 
it is well defined and does not require estimate 
of a physical boundary of subhalos, which is 
often hard to determine. The price is that res- 
olution required to get the Vmax correctly for a 
subhalo is higher, compared for example to the 
total bound mass of subhalo, because V^ax is 
probing the mass distribution in the inner re- 
gions of halo s. 

The total gravitationally bound mass of a 
subhalo, m^ub, is less sensitive to the resolu- 
tion, but requires careful separation between 
real subhalo particles and unbound particles 



The detailed description of the procedure of identifying 
the centers of subhalos is beyond the scope of this paper, 
but is nevertheless pertinent. While many different algo- 
rithms are used in the Hterature 1211 171 122ll23ll24ll25ll26l . 
all algorithms boil down to the automated search for den- 
sity peaks (most often in configuration space, but some- 
times in the phase space) field smoothed at a scale com- 
parable to or smaller than the size of the smallest sub- 
halos in the simulations. Once the peaks are identified, 
the gravitationally bound material around them is usually 
found by iteratively removing the unbound particles. 



from the diffuse halo of the host dark matter 
halo. This can be quite difficult in the inner re- 
gions of the host system where density of the 
background diffuse halo is comparable to the 
internal density of subhalo or when two subha- 
los overlap substantially. 

An alternative option is to define mass of a 
subhalo within a fixed physical radius. For suit- 
ably chosen radius value, the mass can be mea- 
sured unambiguously both in simulations and in 
observations. We will discuss the measurement 
of the enclosed mass and comparisons between 
simulations and observations below in § |3] and 
§ refsec:models (see Figs.[8]and 12). 

Figure |3] shows the cumulative circular ve- 
locity and mass functions (CVF and CMF) of 
subhalos within the virial radiu^ of a simu- 
lated Milky Way sized halo, formation of which 
was illustrated in Figure [T] Both cumula- 
tive functions can be approximated by power 
laws over the ranges of circular velocity and in 
units of Vmax and virial mass of the host: v = 
Vm«./C^x < 0.1 and/z = m,,b/Mhf < 0.001 
with the slopes of -3.7 ^ -4 and « -0.8 ^ -0.9, 
respectively. At large circular velocities devia- 
tions from the power law can be significant due 
to small numbers of subhalos. 

The highest resolution simulations (to date) 
of the individual MW- sized DM halos formed 
in the concordance ACDM cosmology [|28l |29l 
[30il show that the power laws with the slopes 
in the range indicated above describe the CVF 
and CMF down to // « 10"^ and v « 10 2. 
Note, however, that over a wider range of sub- 
halo masses the power law can be expected 
to change slowly reflecting the changing slope 
of the rms fluctuations as a function of scale, 
which controls the abundance of halos as a 
function of mass OTllBll . 

The amplitude of the mass and velocity func- 



^ Defined as /?a - (3MA/47rAp)'^^, where p is the mean 
matter density in the universe and A = 337, correspond- 
ing to the z = virial overdensity suggested by the spher- 
ical collapse model in the ACDM cosmology 1271 . 
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Fig. 3 — Cumulative circular velocity and mass function 
of subhalos within the virial radius Ry,j,7 - 328 kpc of a 
halo of virial mass M337 = 2 X 10'^ Mo at z = 0. The 
dashed lines show power laws with the slopes indicated 
in the legend of each panel. 

tions is sensitive to the normalization of the 
power spectrum on small scales [l33l |281 and 
is thus sensitive to the cosmological parameters 
that control the normalization (such as tilt and 
normalization erg). 

For a given cosmology, the normalization 
of the CVF and CMF scales approximately 
linearly with the host halo mass [T7l|: A^(> 
lu\Mh) oc Mh. The halo-to-halo scatter in the 
normalization of CVF and MCF for a fixed host 
halo virial mass is described by the Poisson dis- 
tribution IfTTll : cr^(>^) = ■\JN{> ix). The frac- 
tional scatter is therefore quite small for small 
jj. and V (large A^). 

The mass and circular velocity functions 
within a given radius describe the overall abun- 
dance of subhalos of different mass, but not 
their radial distribution. The latter depends 
rather sensitively on how the subhalo samples 



are selected ||34l . This is because subhalos at 
different distances from their host halo center 
on average experience different tidal mass loss, 
which affects different subhalo properties by 
different amount. Subhalo mass is the most af- 
fected quantity as large fraction of halo mass 
when it accretes is relatively loosely bound and 
is usually lost quickly. Although circular veloc- 
ity is determined by the inner mass distribution 
in the inner mass of subhalos, it is still affected 
by tidal stripping (albeit to a lesser degree and 
slower than the total mass f35l). 

The average mass loss experienced by sub- 
halos increases with decreasing distance to the 
center of the host halo [ j34l . Therefore, select- 
ing subhalos based on their current bound mass 
or circular velocity biases the sample against 
subhalos at smaller radii and results in the ra- 
dial distribution much less concentrated than 
the overall mass distribution of the host halo 
dm IMl Ian EH [351 [M El. conversely, one 
can expect that if the selection of subhalos is 
made using a quantity not affected by stripping, 
the bias should be smaller or even disappear al- 
together. 

The top panel of Figure |4] shows the radial 
distribution of subhalos (the same population as 
in Figure |3]) selected using their current bound 
mass or circular velocity and density profile of 
dark matter within a MW- sized sized host halo. 
The figure shows that the subhalo distribution is 
less radially concentrated compared to the over- 
all density profile because selection using cur- 
rent subhalo properties affected by tidal evolu- 
tion biases the sample against the inner regions. 
The bottom panel shows the radial distribution 
of subhalos in the same host halo but now se- 
lected using circular velocity and mass the sub- 
halos had before accretion (which are of course 
not affected by the tides). In this case the ra- 
dial profile is very close to that of the dark mat- 
ter distribution. This dependence of the radial 
profile on the property used for subhalo selec- 
tion should be kept in mind when the observed 
and predicted radial distributions are compared. 
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Fig. 4 — Radial distribution of subhalos (solid lines) 
selected using their different properties (Vmax and total 
mass — solid red and blue lines, respectively) compared 
to the matter density profile in their host halo (dashed 
lines). The upper panel shows the profiles for subhalos 
selected using circular velocity and bound mass a the cur- 
rent epoch, while the lower shows radial distribution of 
subhalos selected using the corresponding quantities be- 
fore subhalo was accreted unaffected by the subsequent 
tidal mass loss. Note that minimum threshold values for 
subhalo selection, jUmin and Vmin, are different in the lower 
panel because for a typical mass loss many subhalos with 
smaller circular velocities and masses at the time of ac- 
cretion fall below the completeness limit of the simula- 
tion by z = 0. 

The latter are selected based on their luminosity 
(i.e., the stellar mass), which may be affected by 
tides much less than either the total bound mass 
or circular velocity 041 [391 . 

Finally, the spatial distribution of satellites 
is not completely spherically symmetric, but is 
triaxial, which reflects their accretion along fil- 
aments and subsequent evolution in the triaxial 
potential of their host halos [,40ll4T1l . 

2.2 Quantifying populations of luminous galac- 
tic satellites. Although we currently know only 



a few dozen of nearby satellite galaxies around 
the Milky Way and Andromeda, these galaxies 
span a tremendous range of the stellar densities 
and luminosities. The two brightest satellites 
of the Milky Way, the Large and Small Magel- 
lanic Clouds (LMC and SMC), are easily visi- 
ble by the naked eye in the southern hemisphere 
and have therefore been known for many hun- 
dreds of years, while the faintest satellites have 
been discovered only very recently using so- 
phisticated search algorithms and the vast data 
sets of stellar photometry in the Sloan Digi- 
tal Sky Survey and contain only a few hundred 
stars [42l,l43l|. Up until late 1990s only a dozen 
dwarf galaxies were known to exist within 300 
kpc of the Milky Way, with a similar num- 
ber around the Andromeda [14411 . These galax- 
ies have luminosities L > 10^ L© and mor- 
phologies of the three types: 1) dwarf irreg- 
ular galaxies (dh-rs, e.g., LMC and SMC) — 
low surface brightness galaxies of irregular ap- 
pearance which have substantial amount of gas 
and thus exhibit continuing star formation, 2) 
dwarf spheroidal galaxies (dSphs, e.g., Draco 
or Fornax) — low surface brightness galax- 
ies with spheroidal distribution of stars and no 
(or very little) ongoing star formation and 3) 
dwarf elliptical galaxies (dEs, e.g. M32) — 
high-surface brightness, low-luminosity ellip- 
ticals with no gas and no current star forma- 
tion. dSph and dE galaxies tend to be located 
within 200 kpc of their host galaxies, while dirr 
galaxies are distributed more uniformly. This 
tendency is called the "morphological segrega- 
tion" ([44J) and appears to exist in other nearby 
groups of galaxies [[45|[ . The properties of these 
"classical" dwarf galaxies are reviewed exten- 
sively by [44] (see also recent study of scaling 
relations of dwarf galaxies by [46J . 

Despite the wide range of observed proper- 
ties, all of the nearby dwarfs share some com- 
mon features in their star formation histories 
(SFHs). The SFHs of all classical dwarfs is 
characterized by a rather chaotically varying 
star formation rates. Most bright dwarfs form 




Fig. 5 — The distance to which the current samples of 
dwarf satelhte galaxies around the Milky Way are com- 
plete as a function of galaxy luminosity (absolute V-band 
magnitude on the bottom scale and physical luminosity 
in units of solar luminosity at the top scale). Note that 
the completeness distance of the faintest recently discov- 
ered satellites is < 50 kpc. Adopted from L5 1 J . 

Stars throughout their evolution, although the 
majority of stars may be formed in several main 
episodes spread over ten or more billion years 
PTl SHI Hil [501, and have at least some frac- 
tion of stars that formed in the first two billion 
years of the evolution of the universe. In terms 
of their SFHs, the main difference between the 
dwarf irregular and dwarf spheroidal galaxies is 
in the presence or absence of star formation in 
the last two billion years [|49ll . 

The radial distribution of the classical satel- 
lites around the Milky Way is rather compact. 
For the dwarf galaxies within 250 kpc of the 
Milky Way, the median distance to the cen- 
ter of the Galaxy is « 70 kpc [|35l |52l, while 
the predicted median distance for subhalos is 
X 120 - 140 kpc [35| (see Figure [T3] below). 



The distribution of the satellites around the 



Fig. 6 — The luminosity function of dwarf galaxies 
around the Milky Way. The function includes all ob- 
served galaxies within 417 kpc of the Milky Way. The 
red circles connected by the dotted line show the lu- 
minosity function as observed, without any corrections 
for incompleteness. The green circles connected by the 
dashed line show the observed luminosity function cor- 
rected for limited coverage of surveys on the sky. The 
blue circles connected by the solid line show luminos- 
ity function corrected for the radial bias using radial dis- 
tribution of subhalos from the Via Lactea I simulation. 
Adopted from \5T\. 

Andromeda galaxies is consistent with that of 
the Milky Way satellites, but is less accurately 
determined due to larger errors in distances. 
The spatial distribution of satellites about the 
Milky Way and Andromeda is also manifestly 
non-isotropic with the majority of the satellites 
found in a flattened structure nearly perpendic- 
ular to the disk [53] [51 |56l |57l . 

In 1994, a new faint galaxy was discov- 
ered in the direction toward the center of the 
Galaxy (in the Sagittarius constellation, [[SSlD . 
The galaxy is similar to other nearby dwarf 
spheroidal galaxies in its properties but is re- 
markably close to the solar system (the distance 



of only « 28 kpc) and is in the process of be- 
ing torn apart by the tidal interactions with the 
Milky Way. This interaction has produced a 
spectacular tidal tail, which has wrapped sev- 
eral times along the orbit of the Sagittarius 
dwarf 11591 ■ 

The discovery of this new satellite has alerted 
researchers in the field to the possibility that 
other satellites may be lurking undiscovered in 
our cosmic neighborhood. The advent of wide 
field photometric surveys, such as the Sloan 
Digital Sky Survey and the targeted surveys 
of regions around the Andromeda galaxy, and 
new search techniques has resulted in dozens of 
new satellite galaxies discovered during the last 
decade (Sol ED SI [6l |63l IM ISl [Ml 111 with 
many more discoveries expected in the near fu- 
ture [[STi |67l . The majority of the newly dis- 
covered galaxies are fainter than the "classical 
dwarfs" known prior to 1998. Due to their ex- 
tremely low luminosities (as low as ~ 1000 Lq 
in the case of the Segue 1 [|43ll ') they have col- 
lectively been referred to as the "ultra-faint" 
dwarfs. Such low luminosities (and implied 
stellar masses) indicate an extreme mode of 
galaxy formation, in which the total popula- 
tion of stars produced during galaxy evolution 
is smaller than a star cluster formed in a single 
star formation event in more luminous galaxies. 

More practically, the extreme faintness of the 
majority of dwarf satellites implies that we have 
a more or less complete census of them only 
within the volume of ~ 30 - 50 kpc of the 
Milky Way BM |5ll- Figure [5] shows the dis- 
tance to which the dwarfs of a given luminos- 
ity are complete in the SDSS survey, in which 
the faintest new dwarfs have been discovered. 
The figure shows that we have a good census 
of the volume of the Local Group only for the 
relatively bright luminosities of the "classical" 
satellites. At the fainter luminosities of the ul- 
tra faint dwarfs, on the other hand, we can ex- 
pect to find many more systems at larger radii 
in the future deep wide area surveys. The ex- 
act number we can expect to be discovered de- 



pends on their uncertain radial distribution, but 
given the numbers of already discovered dwarfs 
and our current knowledge of the radial distri- 
bution of brighter satellites (and expected radial 
distribution of subhalos) we can reasonably ex- 
pect that at least a hundred faint satellites exist 
within 400 kpc of the Milky Way. This is illus- 
trated in Figure |6| which shows the luminosity 
function of the Milky Way satellites corrected 
for the volume not yet surveyed under difi'er- 
ent assumptions about radial distribution of the 
satellites 1511 . 

The basis for considering these extremely 
faint stellar systems as bona fide galaxies is 
the fact that, unlike star clusters, they are dark 
matter dominated: i.e., the total mass within 
their stellar extent is much larger than the stellar 
mass expected for old stellar populations [43]. 
The total dynamical masses of these galaxies 
are derived using kinematics of stars High 
resolution spectroscopy of the red giant stars in 
the vicinity of each galaxy provides the radial 
velocities of these stars. The radial velocities 
can then be modeled using using the Jeans equi- 
librium equations to derive the total mass pro- 
file [69] ITUl El d |73l IH] . This modeling re- 
quires certain assumptions about the unknown 
shape of the stellar distribution and velocity dis- 
tribution of stars, as well as assumptions about 
the shape and radial profile of the dark matter 
distribution. The resulting mass profile there- 
fore has some uncertainty associated with these 
assumptions Il69ll72ll74l . 

Additionally, the ultra-faint dwarfs follow 
scaling relations of the brighter classical satel- 
lites such as the luminosity-metallicity relation 
[TTSI and therefore seem to be the low luminos- 
ity brethren within the family of dSph galaxies. 



These faint dwarf spheroidal galaxies do not have cold 
gas and therefore their mass profiles cannot be measured 
using the gas rotation curve, as is commonly done for 
more massive dirr galaxies. 



3 Defining the substructure problem. 

As I noted above, comparison of theory and ob- 
servations in terms of the directly observable 
quantities such as luminosities is possible only 
using a galaxy formation model. These mod- 
els, although actively explored ( [r76ll77l l78l[35l 
I73IHUIIII1IE3, see also §143]) are considerably 



^ I did not include the new ultra-faint satellites in the 
comparison both because their V^ax values are much 
more uncertain and because their total number within 
the virial radius requires uncertain corrections from the 
currently observed number that probes only the nearest 
few dozen kpc. The velocity dispersions of the ultra- 
faint dwarfs are very similar to each other (~ 5 km/s) and 
they therefore formally have similar Vmux values accord- 
ing to this simple conversion method (hence, they would 
all be "bunched up" at about the same V^-dx ~ 9 km/s 
value). The maximum circular velocity of the halos of 
these galaxies is expected to be reached at radii well 
beyond the stellar extent and its estimate from the ob- 
served velocity dispersions requires substantial extrapo- 
lation and assumptions about the density profile outside 
the radii probed by stars. The errors of the derived values 
of Vmax can therefore be quite substantial |i69i|83J. 1 will 
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more uncertain than the predictions of dissipa- 
tionless simulations on the properties of dark 
matter subhalos. Given that observed dwarf 
satellites are very dark matter dominated, the 
dissipative processes leading to formation of 
their stellar component are expected to have a 
limited effect on the distribution of the dynam- 
ically dominant dark matter. Fruitful compari- 
son between simulation predictions and obser- 
vations is therefore possible if a quantity related 
to the total mass profile can be measured in the 
latter. 

The first attempts at such comparisons [r71[T6l 
assumed isotropy of the stellar orbits and con- 
verted the line-of- sight velocity dispersion of 
stars in dSph satellites, cr,-, to estimate their 
maximum circular velocities as V^ax = ^CTr- 
The admittedly over-simplistic conversion was 
adopted simply due to a lack of well measured 
velocity profiles and corresponding constraints 
on the mass distribution at the time. Figure |7] 
shows such a comparison for the classical satel- 
hte^ of the Milky Way and subhalo popula- 
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Fig. 7 — Comparison of the cumulative circular velocity 
functions, A^(> Vmax). of subhalos and dwarf satellites of 
the Milky Way within the radius of 286 kpc (this radius 
is chosen to match the maximum distance to observed 
satellites in the sample and is smaller than the virial ra- 
dius of the simulated halo, R^^j = 326 kpc). The subhalo 
VFs are plotted for the host halos with max. circular ve- 
locities of 160 km/s and 208 km/s that should bracket the 
Vmax of the actual Milky Way halo. The VP for the ob- 
served satellites was constructed using circular velocities 
estimated from the line-of-sight velocity dispersions as 
Vmax - ^cTr (scc discussion in the text for the uncer- 
tainties of this conversion). 

tions in Milky Way-sized halos formed in the 
concordance ACDM cosmology. 

The observed velocity function is compared 
to the predicted VF of dark matter subhalos 
within a 286 kpc radius of Milky Way-sized 
host halos. In the literature, "Milky Way-sized" 
is often used to imply a total virial mass of 
Mvir ~ 10'^ Mq and maximum circular veloc- 
ity of Vmax * 200 km/s. However, there is 
some uncertainty in these numbers. Therefore 
the figure shows the VFs for the host halos with 
Kiax = 208 km/s and 160 km/s. The former is 
measured directly in a simulation of the halo 
of that circular velocity, while the latter VF 



compare the predicted luminosity function of the lumi- 
nous satellites using a simple galaxy formation model in 



§ 4.3 (see Fig. 
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was rescaled as N{> Vmax) ■^vir,i/-^vir,2 = 
(Vmax,i/V'max,2)^'^, using scaling measured statis- 
tically in the simulations [17J. 

The simple conversion of cr^ to Vmax has 
justly been criticized as too simplistic [[84]| . In- 
deed, the conversion factor 77 = Vmax/cTr re- 
quires a good knowledge of mass profile from 
small radii to the radius r^ax- The mass pro- 
file derived from the Jeans equation has errors 
associated with uncertainties in the anisotropy 
of stellar orbits, as well as with uncertainties of 
spatial distribution of stellar system and/or its 
dynamical state f SSHTOll . Most importantly, the 
mass profile is only directly constrained within 
the radius where stellar velocities are measured, 
r*. If this radius is smaller than rmax, conver- 
sion factor 77 depends on the form of the density 
profile assumed for extrapolation. The uncer- 
tainties of the derived mass profile within the 
stellar extent will of course also be magnified 
increasingly with increasing r^aax/r* ratio ll85l . 

Thus, for example, Stoehr et al. ri84ll have 
argued that the conversion factor can be quite 
large (7/ > 2 - 4) if the density profile is shal- 
low in the inner regions probed by the stars. 
Such large conversion factor would shift the 
low Vynax points in Figure |7] to the right closer 
to the subhalo VF [IMIHlllTl and would imply 
that there is a sharp drop in the luminosity func- 
tion of satellites below a certain threshold circu- 
lar velocity {Vmax ~ 30 km/s). High-resolution 
simulations of individual satellites, on the other 
hand, have demonstrated that the CDM satel- 
lites retain their cuspy inner density profiles, 
even as they undergo significant tidal stripping 
[|87]| . This implies that rj is likely not as large 
as advocated by Stoehr et al. The main uncer- 
tainty in its actual value for a specific satellite is 
then due to the uncertainty in the density profile 
and ratio r^axlr*. 

Recently, Peiiarrubia et al. [1721 combined 
the measurements of stellar surface density and 
ctXR) profiles to estimate y^ax values for indi- 
vidual observed MW satellites under assump- 
tion that their stellar systems are embedded into 



NFW dark matter potentials. Such procedure 
by itself does not produce a reliable estimate 
of Vmax, because NFW potentials with a wide 
range of Vmax can fit the observed stellar pro- 
files. To break the degeneracy Peiiarrubia et 
al. have used the relation between Vmax and 
Vax expected in the concordance ACDM cos- 
mology. They showed that this results in es- 
timates of Vmax which imply 77 « 2 - 3 (see 
their Fig. 4). This estimate does not take into 
account effects of tidal stripping on the evolu- 
tion of the Vax - Kiax relation. Typically, sub- 
halos located in the inner regions of the halo 
are expected to have lost ~ 60 - 90% of their 
initial mass by z = due to tidal stripping 
[|35l|34l[Ml- For such mass loss V^ax changes 
only by 20 - 30% [[351 HI but Vax should 
change by a factor of ~ 2-3 [I87ll39l. 

In a subsequent study, Peiiarrubia et al. [|39| 
used controlled simulations of subhalo evolu- 
tion to argue that tidal stripping does not sig- 
nificantly affect their inferred conversion factor 
77 (see their Fig. 9). This conclusion, however, 
was drawn based on the systems in which both 
stellar system and DM halo were significantly 
stripped. In such system r^ax is close to the stel- 
lar radius and ctq and y^ax evolve in sync. For 
systems with more realistic mass loss and with 
stars deeply embedded within r^ax, however, 
stellar system (and ctq) may not be affected, 
while Vmax can evolve significantly. For such 
systems the method of [17211 will lead to a sig- 
nificant overestimate of 77. Indeed, the systems 
in Fig. 9 of [39J for which the method overesti- 
mates 77 (by a factor of ~ 1 .4) the most are the 
systems with moderate total mass loss and least 
affected stellar systems. Note that even these 
systems have likely experienced more tidal loss 
than most of the real dSph satellites. 

Another factor in estimates of V^ax is 
anisotropy of stellar velocities in dSph (e.g., 
WT\ ). For example, recent analysis of observed 
velocity dispersion profiles of "classical" dSph 
by fT3\, in which the anisotropy of stellar or- 
bits was treated as a free parameters, results in 
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Fig. 8 — The mass function of dwarf satellites of the 
Milky Way, where masses of subhalos and observed 
satellites are measured within a fixed physical radius of 
0.6 kpc. Adopted from I.85J . 

estimates of Vmax of their host subhalos in the 
range ~ 10-25 km/s, smaller than would be 
suggested if correction factor was r] ^ 2 - 3. 

Regardless of the actual conversion factor 
value, however, it is clear that it cannot change 
the main difference between the observed and 
predicted VFs - the large difference in their 
slope - unless rj strongly depends on V^ax (there 
is no observational evidence for this so far). 

Another promising approach is to abandon 
attempts to derive V^ax altogether and to mea- 
sure instead the observed mass within the ra- 
dius where the uncertainty of the measured 
mass profile is minimal. Such radius is close to 
the stellar extent of observed galaxies [|85l [83l 
El d. Figure [8j adopted from MB, shows 
comparison of the mass functions of subha- 
los in the Via Lactea I simulation [89| and 
observed satellites of the Milky Way, where 
masses are measured within a fixed physical 



radius of 600 pc (see also discussion in § 4.3 



and Fig. 12). The figure shows that the simu- 



lated and observed mass functions are different, 
the conclusion similar to that derived from the 



comparison of the circular velocity functions. 

Thus, the discrepancy that is clearly seen 
in the comparison of circular velocity func- 
tions, measured with more uncertainty in ob- 
servations, persists if the comparison is done 
using a much better measured quantity. Un- 
fortunately, the stellar distribution in most of 
the newly discovered ultra-faint dwarf galax- 
ies does not extend out to 600 pc radius and 
m(r < 0.6 kpc) = mo.e therefore cannot be mea- 
sured as reliably for these faint systems as for 
the classical dwarfs. Similar comparisons have 
to be carried out using masses within smaller 
radii. This puts stringent requirements on the 
resolution of the simulations, as they need to 
reliably predict mass distribution of subhalos 
within a few hundred parsec radius. Such high- 
resolution simulations are now available, how- 
ever f28l|29l [301. 

We can draw two main conclusions from the 
comparisons of the circular velocity functions 
and the more reliable mo.e mass functions of 
subhalos and observed satellites presented in 
the previous sections, even taking into account 
existing uncertainties in deriving circular veloc- 
ities and the total dynamical masses for the ob- 
served satellites. First, the predicted abundance 
of the most luminous satellites is in reasonable 
agreement with the data, even though the statis- 
tics are small. Most MW-sized halos simulated 
in the concordance ACDM cosmology have 1- 
2 LMC sized (V^ax « 60 - 70 km/s) subhalos 
within their virial radius. 

This is not a trivial fact because the abun- 
dance of the most massive satellites is deter- 
mined by a subtle interplay between the accre- 
tion rate of systems of corresponding circular 
velocity and their disruption by the combined 
effects of the dynamical friction and tidal strip- 
ping [|90l . Dynamical friction causes satellites 
to sink to the center at a rate which depends 
on the mass and orbital parameters of satellite 
orbit. Orbital parameters, in turn, depend on 
the cosmological environment of the accreting 
host halo and are mediated by the tidal strip- 
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ping which reduces satellite mass as it sinks, 
thereby rendering dynamical friction less effi- 
cient BmiHillllll. The fact that the concor- 
dance ACDM model makes an ab initio predic- 
tion that the number of massive satellites that 
can host luminous dwarfs is comparable to ob- 
servations can therefore be viewed as a success 
of the model. 

Second, the slopes of both the circular ve- 
locity function and the mo.6 mass function are 
different in simulations and observations. This 
implies that we cannot simply match all of 
the luminous satellites to the subhalos with the 
largest Vmax and mo.6, as was sometimes advo- 
cated (841 [86l. The mo.6 mass function compar- 
ison, in particular, indicates that there should be 
some subhalos with the mo g ~ 10^ Mq that do 
not host the luminous galaxies, and some that 
do. As I discuss in § 4.2[ this has a strong im- 
plication for the physical interpretation of the 
difference in terms of galaxy formation scenar- 
ios. 

In summary, the substructure problem can 
be stated as the discrepancy in the slopes of 
the circular velocity and mo.g mass functions in- 
ferred for observed satellites of the Milky Way 
and the slopes of these functions predicted for 
dark matter subhalos in the MW-sized host ha- 
los formed in the concordance ACDM cosmol- 
ogy. 

I believe that stated this way the problem is 
well-defined. Defining the problem in terms of 
the difference in the actual number of satellites 
and subhalos is confusing at best, as both num- 
bers are fairly strong functions of subhalo mass 
or stellar luminosity. Thus, for example, even 
though the discovery of the ultra-faint dwarfs 
implies the possible existence of hundreds of 
them in the halo of the Milky Way [49,75] (this 
fact has been used to argue that the substructure 
problem has been "alleviated"), the most recent 
simulations show that more than 100,000 sub- 
halos of mass msub > 10^ Mq should exist in the 



Milky Way [|28l IHQ The substructure prob- 
lem stated in the actual numbers of satellites is 
therefore alive and well and has not been alle- 
viated in the least. 

I would like to close this section by a brief 
discussion of the comparison of spatial distri- 
bution of observed satellites and subhalos. As 
I noted above, the radial distribution of the ob- 
served satellites of the Milky Way is more com- 
pact than the radial distribution of subhalos se- 
lected using their present day mass or circular 
velocity [l35l[52ll97ll . In addition, the observed 
satellites are distributed in a quite flattened 
structure with its plane almost perpendicular to 
the disk of the Milky Way [|53l IM EH |56l |53 • 
Although the spatial distribution of all subha- 
los is expected to be anisotropic, reflecting the 
anisotropy of their accretion directions along 
filaments [|40l |4T1| and, possibly, the fact that 
some of the satellites could have been accreted 
as part of the same group of galaxies it is 
not as strong as the anisotropy of the observed 
Milky Way satellites. 

Thus, both the radial distribution and 
anisotropy of the observed satellites do not 
match the overall distribution of subhalos in 
CDM halos. This is likely another side of the 
same substructure problem coin and the overall 
spatial distribution of observed satellites needs 
to be explained together with the differences in 
the circular velocity function. I will review a 
few possible explanations for the substructure 
problem and differences in the spatial distribu- 
tion in the next section. 

4 Possible solutions 

4.1 Modifications to the CDM model. One pos- 
sible way to account for the differences of the 



Indeed, it is not obvious a priori that subhalos of mass 
10^ - 10^ Mq are too small to host luminous stellar sys- 
tems of stellar mass M. ~ 10^ - 10"* Mq f95l — the 
stellar masses corresponding to the luminosities of the 
faintest recently discovered dwarfs. After all, the halos 
of this mass are expected to be hosting formation of the 
very first stars h96i . 
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predicted and observed circular velocity and 
mo.6 mass functions is to assume that ACDM 
model is incorrect on the small scales probed 
by the dwarf galactic satellites. Indeed, the 
abundance of satellites is sensitive to the ampli- 
tude of the power spectrum on the scales corre- 
sponding to the total mass of their host halos. 
For a halo of mass M the comoving scale of 
fluctuations that seed their formation is 
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where Q.^ is the present-day total matter den- 
sity in units of the present-day critical density, 
Pcrito = 3H^/SnG and Hq is the current Hubble 
constant in units of km/s/Mpc. 

If the amplitude of density fluctuations on 
such scales is considerably suppressed com- 
pared to the concordance ACDM model used 
in most simulations, the abundance of subha- 
los can then also be suppressed. Such suppres- 
sion can be achieved either by suitably vary- 
ing parameters controlling the amplitude of the 
small-scale power spectrum within the ACDM 
model itself [33], such as the overall normal- 
ization of the power spectrum or its large-scale 
tilt, or by switching to models in which the 
amplitude at small scales is suppressed, such 
as the warm dark matter (WDM) structure for- 
mation scenarios 1991 [M [Ml |33l [Mil. In 
these models the abundance of satellites is sup- 
pressed both because fewer halos of dwarf mass 
form in the first place (due to smaller initial am- 
plitude of fluctuations) and because halos that 
do form have a less concentrated internal mass 
distribution, which makes them more suscep- 
tible to tidal disruption after they accrete onto 
their host halo. Models in which dark matter 
was assumed to be self-interacting, a property 
that can lead to DM evaporation, have also been 
proposed and discussed I1103[ I104L but these 
models both run into contradiction with other 



observational properties of galaxies and clus- 
ters [[T05l[T06l[T071[T08]l and are now strongly 
disfavored by observational evidence indicating 
that dark matter self-interaction is weak H 10911 . 

The problem, however, is more subtle than 
simply suppressing the number of satellites. As 
discussed above, diff"erences exist between ob- 
served and predicted slopes of the circular ve- 
locity and mass. The slope is controlled by 
the slope of the primordial fluctuation spectrum 
around the scale corresponding to the masses 
of satellite halos and structural properties of the 
forming halos. It has not yet been demonstrated 
convincingly whether both the circular veloc- 
ity and the mo.e mass functions can be repro- 
duced in any of the models alternative to CDM. 
In fact, recent measurements of mass distribu- 
tion in the central regions of observed satel- 
lites put stringent constraints on the phase space 
density and "warmness" of dark matter [69]. 
At the same time, measurements of the small- 
scale density power spectrum of the Lyman <x 
forest indicate that fluctuations with the ampli- 
tude expected in the ACDM model at the scales 
that control the abundance of dwarf mass halos 
are indeed present in the primordial spectrum 

ttnoiiiiD. 

While the inner density distribution in ob- 
served satellites may still be afl^ected by dark 
matter warmness in the allowed range of pa- 
rameter space [1711 (see, however, iTTSlI ). the 
models with such parameters would not sup- 
press the overall abundance of satellites con- 
siderably. In fact, observations of flux ratios in 
the multiple image radio lenses appear to re- 
quire the amount of substructure which is even 
larger than what is typically found in the CDM 
halos nil2[|113[[TT4| . disfavoring models with 
strongly suppressed abundance of small mass 
subhalos. 

There is thus no compelling reason yet to 
think that the observed properties of galactic 
satellite populations are more naturally repro- 
duced in these models. In the subsequent dis- 
cussion, I will therefore use Occam's razor and 
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focus on the possible explanations of the differ- 
ences between observed satellites and subhalos 
in simulations within the ACDM model. The 
prime suspect in producing the discrepancy is 
the still quite uncertain physics of galaxy for- 
mation. After all a similar problem exists for 
objects of larger masses and luminosities if we 
compare the slope of the luminosity function 
and the halo mass function 111151 1116II or the 
predicted and observed abundance of galaxies 
in the nearby low density "field" regions [I117L 

4.2 Physics of galaxy formation. Several plau- 
sible physical processes can suppress gas ac- 
cretion and star formation in dwarf dark mat- 
ter halos. The cosmological UV background, 
which reionized the Universe at z > 6, heats 
the intergalactic gas and establishes a character- 
istic time-dependent minimum mass for halos 
that can accrete gas [M [HOl [Ml [HI 
[T231 [T241 [T251I . The gas in the low-mass ha- 
los may be photoevaporated after reionization 
lfT26l [TiTl [HI or blown away by the first gen- 
eration of supemovae [fT29l[nol[T3n[T32ll (see, 
however, 111331 ). At the same time, the ion- 
izing radiation may quickly dissociate molecu- 
lar hydrogen, the only efficient coolant for low- 
metallicity gas in such halos, and prevent star 
formation even before the gas is completely re- 
moved I1134II . Even if the molecular hydrogen 
is not dissociated, cooling rate in halos with 
virial temperature^ Tvir < 10"^ K is consider- 
ably lower than in more massive halos [I135II 
and we can therefore expect the formation of 
dense gaseous disks and star formation sup- 
pressed in such halos. Another potential galaxy 
formation suppression mechanism is the gas 
stripping effect of shocks from galactic out- 
flows and cosmic accretion Ill36[|137l l. Finally, 



^ The virial virial temperature Tvir is related to the virial 
mass by kT^{^ - ^fiinpGMyiJRyij, where isothermal tem- 
perature profile is assumed for simplicity. The virial 
mass and radius are related by definition as Mvii = 
47r/3 X Avirp/?''-.. Assuming Ayir - 178 appropriate 
for z > 1 regardless of Qq, this gives Mvii ~ 5.63 x 



even if the gas is accreted and cools in small- 
mass halos, it is not guaranteed that it will form 
stars if gas does not reach metallicities and sur- 
face densities sufficient for efficient formation 
of molecular gas and subsequent star formation 
[|35l [T38l[T39l[T40l[T4T]| . 

The combined effect of these processes is 
likely to leave most of dark matter halos with 
masses < few x 10^ Mq dark, and could have 
imprinted a distinct signature on the properties 
of the dwarf galaxies that did manage to form 
stars before reionization. In fact, if all these 
suppressing effects are as efficient as is usu- 
ally thought, it is quite remarkable that galax- 
ies such as the recently discovered ultra-faint 
dwarfs exist at all. One possibility extensively 
discussed in the literature is that they man- 
aged to accrete a certain amount of gas and 
form stars before the universe was reionized 
[lia |77l |78l [Iia 1113 [IM Ell El. Direct 
cosmological simulations do show that dwarf 
galaxies forming at z > 6 bear striking resem- 
blance to the faint dwarf spheroidal galaxies or- 
biting the Milky Way [T46l [T47l and their 
predicted abundance around the Milky Way is 
consistent with estimates of the abundance of 
the faintest satellites [I142L Alternatively, some 
authors argued (HI [T48l [861 [Till that observed 
dwarf satellite galaxies could be in much more 
massive subhalos than was indicated by sim- 
ple estimates of dynamical masses and circular 
velocities using stellar velocity dispersions. In 
this case, the relatively large halo mass could 
allow an object to resist the suppressing effects 
of the UV background. 

Cosmological simulations also clearly show 
that the subhalos found within virial radii of 
larger halos at z = have on average lost signif- 
icant amount of mass and have been consider- 
ably more massive in the past [[35l l34l [88l . The 
dramatic loss of mass occurs due to the tidal 
forces that subhalos experience as they orbit in 
the potential of the host. A significant frac- 
tion of the luminous dwarf satellites therefore 
can be associated with those subhalos that have 
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been substantially more massive in the past and 
hence more resilient against galaxy formation 
suppressing processes. Such subhalos could 
have had a window of opportunity to form their 
stellar systems even if the subhalos they are 
embedded in today have relatively small mass 

m. 

4.3 Models for luminous satellite population. 

Given the galaxy formation suppression mech- 
anisms and evolutionary scenarios listed above, 
the models aiming to explain the substructure 
problem can be split into the following broad 
classes: 1) the "threshold galaxy formation 
models" in which luminous satellites are em- 
bedded in the most massive subhalos of CDM 
halos and their relatively small number indi- 
cates the suppression of galaxy formation in 
subhalos of circular velocity smaller than some 
threshold value UHl [M 123 and 2) "selective 
galaxy formation models " in which only a frac- 
tion of small subhalos of a given current Vmax 
and mass host luminous satellites while the rest 
remain dark. 

In the second class of models the processes 
determining whether a subhalo hosts a lumi- 
nous galaxy can be the reionization epoch [1761 
[T451 [T431 [T491 [IM [H3: subhalos that assem- 
ble before the intergalactic medium was heated 
by ionized radiation become luminous. The ob- 
served faint dwarfs can then be the "fossils" of 
the pre-reionization epoch H 1451 11461 . Subhalo 
may also form a stellar system if its mass as- 
sembly history was favorable for galaxy forma- 
tion [35 J : namely, luminous subhalos are those 
that have had sufficiently large mass during a 
period of their evolution to allow them to over- 
come the star formation suppression processes. 

Several models using a combination of the 
processes and scenarios outlined above have 
been shown to reproduce the gross properties 
of observed population of satellites reasonably 
well OSl IM [HIl m. How can we test differ- 
ent classes of models and differentiate between 
specific ones? 



First, I think the fact the mo.g mass func- 
tion for observed satellites has a different slope 
compared to simulation predictions (Fig. [8]) fa- 
vors the second class of the selective galaxy for- 
mation models, at least for the brighter "clas- 
sical" satellites. Indeed, given what we know 
about the average mass loss of subhalos, it 
is more natural to associate the observed sys- 
tems with the halos of the largest mass prior 
to accretion [|35| rather than with the subha- 
los with the largest current masses. Second, 
the predicted number of the weakly evolving 
pre-reionization objects ni42[ 11441 1791 . the ex- 
tended star formation histories of most of the 
observed dwarf satellite^] [fTSOl I48l|49|l50l. and 
the significant spread in metallicities and cer- 
tain isotope ratios [I151II indicate that majority 
of "classical" dwarfs have not formed most of 
their stars before reionization but have formed 
their stars over rather extended period of time 
(~ 10 Gyr). It is still possible, however, that a 
sizable fraction of the ultra-faint dwarfs are the 
"pre-reionization fossils" ffT42l[T52l[T46l[T53]l . 
if star formation efficiency in these objects is 
greatly suppressed [|79ll compared to that of 
brighter dwarfs. 

Interesting additional clues and constraints 
on the galaxy formation models available for 
the dwarf satellites of the Milky Way are 
the measurements of the total dynamical mass 
within their stellar extent. Observations show 
that the total masses within a fixed aperture 
of the observed satellites are remarkably sim- 
ilar despite a several order of magnitude span 
in dwarf satellite luminosities [|154[ HH |155[ 
1721 [83|| . For example, the range of masses 
within 0.6 kpc shown in Figure [8] is only an 
order of magnitude. Furthermore, existence of 
the tight correlation between total dynamical 
mass within the half-light radius, M(rhaif), and 



In fact, in terms of star formation the main difference 
between the dhr and dSph galaxies appears to be pres- 
ence or lack of star formation in the last 2 billion years 
before z ^ Il49ll. 
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Fig. 9 — The mass within central 300 pc vs the total 
virial mass of an NFW halo predicted using the con- 
centration mass relation c(Mvir) at different redshifts in 
the concordance WMAP 5-year best fit cosmology. The 
horizontal dotted lines indicate the range of moj masses 
measured for the Milky Way dwarf satellites. 

the corresponding radius rhaif [TTSl for bright 
dSph galaxies implies a very similar inner dark 
matter density profile of their host halos. Re- 
cently, L. Strigari and collaborators [83] have 
shown that the mass estimated within the cen- 
tral 300 pc, m(< 0.3 kpc) = mo.3, for all of the 
dwarfs with kinematic data varies by at most a 
factor of four, while the luminosity of the galax- 
ies varies by more than four orders of magni- 
tude. 

These observational measurements put con- 
straints on the range of masses of CDM sub- 
halos that can host observed satellites. To esti- 
mate this range, we should first note that for the 
CDM halos described by the NFW profile tlSj] 
with concentration c = Rm/rs (where is the 
scale radius - the radius at which the density 
profile has logarithmic slope of -2) the depen- 
dence of the mass within a fixed small radius 
X = r/7?vir on the total virial mass is 

f(cx) 

m(< x) = Myi,——, where (3) 
/(c) 

fix) ^ ln(l +x)--^ (4) 
I + X 



and is quite weak for r = 300 pc: mo.3 
^vir shown in Figure |9] Indeed, even 

for a halo with the Milky Way mass at z = 
we expect mo.3 * 4 x 10' M©, a value not too 
diff"erent from those measured for the nearby 
dwarf spheroidals. Physically, the weak depen- 
dence of the central mass on the total mass of 
the halo reflects the fact that central regions of 
halos form very early by mergers of small-mass 
halos. Given that the rms amplitude of density 
perturbations on small scales is a weak function 
of scale, the central regions of halos of difi'erent 
mass form at a similar range of redshifts and 
thus have similar central densities reflecting the 
density of the universe when the inner region 
was assembled. At earlier epochs the depen- 
dence is stronger because 300 pc represents a 
larger fraction of the virial radius of halos. 

Note that the relation plotted in Figure [9] is 
for isolated halos unaff'ected by tidal stripping. 
Taking into account efl"ects of tidal stripping re- 
sults in even flatter relation liSTI : mo.3 °^ M'^'^^, 
which also has a lower normalization (smaller 
mo.3 for a given Myir- This is likely due to 
a combination of two effects: 1) the halos of 
larger mass have lower concentrations and thus 
can be stripped more efficiently and 2) the ha- 
los of larger mass can sink to smaller radii af- 
ter they accrete and experience relatively more 
tidal stripping. Overall, the effect of stripping 
on mo.3 appears to be substantial and cannot be 
neglected. 

Finally, figure |9] shows that the virial mass 
range corresponding to a given range of mo.3 is 
quite different for halos that form at z > 2 com- 
pared to those that form at later epochs due to 
the rapidly evolving NFW concentration con- 
centration for a fixed halo mass I1156II . The 
mass mo.3 can therefore be only interpreted in 
the context of a model for subhalo evolutionary 
histories. 

Several recent studies have used such mod- 
els to show that the nearly constant central 
mass of the satellite halos is their natural out- 
come JSOl [8ll UM- This outcome can be un- 



derstood as a combination of the weakness of 
the mo.3 - Mvir correlation and the fact that in 
the galaxy formation models galaxy luminosity 
L must be a nonlinear function of Mvir in order 
to produce a faint-end slope of the galaxy lumi- 
nosity function much steeper than the slope of 
the small-mass tail of the halo mass function. 

For example, if the faint-end slope of the lu- 
minosity function is ^ (i.e., dn(L)/dL oc L^) 
and the slope of the halo mass function at small 
mass end is ^ {dn{M)/dM oc M^) and we as- 
sume for simplicity a one-to-one monotonic 
matching between galaxies and halos n(> L) = 
n{> M) (see [1571 for the detailed 

justification for such assumption), the implied 
slope of the L-M^i, relation isyS = (1 -l-^)/(l -i-^), 
which for the fiducial values of ^ « -2 and 
^ « -1.2 gives /3 « 5. In semi-analytic mod- 
els, such a steep nonlinear L - Mvir relation is 
usually assumed to be set by either suppression 
of gas accretion due to UV heating or by gas 
blowout due to SN feedback, e.g., [|7l). If I in- 
stead assume the faint-end slope of^« 1.5-1.6 
as suggested by recent measurement of [I116II , 
the relation is shallower but is still nonlinear: 

Regardless of the specific processes produc- 
ing the nonlinear luminosity-mass relation, for 
a relation of the form L oc M^j^, we have L oc 
m^j, where y « /3/0.25, using the mo^-M^i^ re- 
lation above taking into account effects of tidal 
stripping. To account for a smaller than a fac- 
tor of four spread in central masses for approxi- 
mately four orders of magnitude spread in lumi- 
nosity one needs 7«6-8or/?ai2-4, the val- 
ues not too different from the estimate above. 
Thus, the weak correlation of the mo.3 <^nd lu- 
minosity will be the natural outcome of any 
CDM-based galaxy formation model which re- 
produces the slope of the faint end of the galaxy 
luminosity function. 

Within the framework I just described, the 
slope of the mo.3 - ^ correlation depends on the 
slope of the L - Mvir,acc correlation [5. The mod- 
els published so far fiM EH HI, as well as the 
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Fig. 10 — The mass within the central 300 pc vs luminos- 
ity for the dwarf satellites of the Milky Way (stars with 
error bars, see 1 83 1). The open symbols of different types 
show the expected relation for subhalos in three different 
Milky Way-sized halos formed in the simulations of the 
concordance ACDM cosmology if the luminosity of the 
subhalos is related to their virial mass at accretion epoch 
as L = 5 X lO-' iQ(Mvir,acc/10' ^o)^'^ (see text for discus- 
sion). 

simple model above with the slope jS ~ 2-3, 
the slope of the L - mo.3 relation is shallow but 
is nevertheless not zero. Constraining this slope 
with future observations will tighten constraints 
on the galaxy formation models and will tell us 
more about the L 
ists. 

To illustrate the points just made. Figure [10 
shows the mo.3 ~ ^ relation for the observed 
nearby dwarfs f83| and subhalo^ found within 
300/7^' kpc around three different MW-sized 
halos formed in the concordance cosmologi- 
cal model (see Il35lll42ll for simulation details). 
To assign luminosity to a given subhalo I fol- 
low the logic of the model presented in [l35l . 
which posits that the brightest observed satel- 
lites should correspond to the subhalos which 



Mvir correlation if such ex- 



The simulations used here do not reliably resolve the 
mass within 300 pc. Therefore, in order to calculate the 
mass 1 have used the mass and concentration of each sub- 
halo at the epoch when it was accreted and computed 
evolution of their density profile given the mass loss they 
experienced by the present epoch, as measured in cosmo- 
logical simulations, and using results of controlled high- 
resolution simulations of tidal evolution [39,1 , which pre- 
dict the evolution of the NFW concentration of halos as 
a function of tidal mass loss. The mass mo.3 was then 
computed from the evolved density profile. 
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have the largest mass before they were accreted. 
In this model the luminosity of stellar systems 
should positively correlate with the mass of its 
host subhalo before it was accreted onto the 
MW progenitor, Mvir^acc^ 



= 5 X 10^ Lg 
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2.5 



(5) 



The power law form of the relation is motivated 
by the approximately power law form of the 
galaxy luminosity and halo mass functions at 
faint luminosities and small masses. The actual 
parameters were chosen such that luminosities 
of the most massive subhalos roughly match the 
luminosities of the most massive satellites, such 
as the SMC and LMcQcL ~ 10^ - 10'' Lq). Af- 
ter all, the first order of business for all the mod- 
els of satellite population is to reproduce the 
abundance and luminosities of the most mas- 
sive (Vmax ^ 40 km/s) satellites. The slope of 
the relation in eq. |5] was set to reproduce the 
range of observed satellite luminosities and flat- 
ness of the mo.3 - L relation. Note that this 
model does not assume any threshold for for- 
mation of galaxies. It simply implies that the ef- 
ficiency with which baryons are converted into 
stars, = A/H./Mvir, steadily decreases with de- 
creasing Mvir at the rate given by eq. [5| 

Figure [10] shows that the model with param- 
eters of eq. |5] is in agreement with observed 
measurements of the mo.3 ~ relation. The re- 
sults will not change drastically if a somewhat 
steeper (« 3 - 3.5) slope is assumed]^ 



The figure does not show the most massive subhalos 
which would correspond to the systems such as the Large 
Magellanic Clouds, which have luminosities L > 10^ Lq 
outside the range shown in the figure. This is justified 
because observational points shown in the figure include 
only the fainter dwarf spheroidal galaxies. 
^ The model of equation [s] is similar to the model IB in 
the recent study by Koposov et al. f791, which assumes 
that stellar mass scales as = /*Mo(Mvir,acc/A^o)'^", 
These authors find that the model reproduces the lu- 
minosity function of satellites for /» « 1.7 X 10""*, 
Mo - 10'" Mq, and a = 2, which gives M« = 1.7 x 
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Fig. 1 1 — The cumulative luminosity function of subha- 
los in the three MW-sized halos shown in Fig. [TO] with 
the simple luminosity assignment ansatz of eq.lsf The 
luminosity function includes all subhalos within 417 kpc 
from the center of each halo, the same radius as was used 
to construct the luminosity function of the observed satel- 
lites shown in Fig.|6] 

Having fixed the parameters of the L - 
^^vir.acc relation, we can then ask the question 
of whether the luminosity function of satellites 
would be reproduced self-consistently by such 
a model. We can use the observed luminos- 
ity functions corrected for completeness for the 
faintest dwarfs from [ )5T| (shown in Fig. |6]) to 
test this. Figure 1 1 shows the subhalo luminos- 
ity functions constructed using subhalos identi- 
fied within 417 kpc (the same outer radius used 
in the construction of the observed luminos- 
ity function by [|5T1l ) in the simulations and the 
simple luminosity assignment scheme of equa- 
tion [5j The luminosity functions in Fig. 1 1 
are in reasonable agreement with observations 



10^(Msat/10^ Mq)^, quite similar to the relation given by 
eq.[5] Koposov et al. adjust parameters to match the ob- 
served luminosity function and then show that L - mo.3 
relation is reproduced, while I adopted the opposite route 
here. The key difference between the models is that 
their model assumes a ceiling on the value of M,/Mvir,acc, 
while I assume no such ceiling. Absence of the ceiling on 
star formation efficiency is actually important for bright 



satellites (see Fig. 14 1. 
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Fig. 12 — The cumulative mo.e function of subhalos in 
the ACDM simulations of three MW halos (dotted lines) 
and for the observed dSph Milky Way satellites (points, 
connected by solid Une) Sagittarius was excluded due to 
its very large mass errors), as measured by fSSj. The 
dashed lines show the mass function for subhalos with 
L > 2.6 X 10^ Lq and d < 270 kpc (the same range of 
luminosities and distances as for the observed satellites) 
with luminosities assigned using equation [5] I have ex- 
cluded the two most luminous objects from the model 
luminous satellites to account for the fact that SMC and 
LMC are not included in the observational sample and 
one additional object to account for non-inclusion of the 
Sagittarius dwarf in the comparison. 



within current uncertainties both in their ampli- 
tude and slope 0.3). 



Figure 12 shows comparison of the mo.e mass 
functions for the observed "classical" dwarf 
spheroidal satellites of the Milky Way [[85l 
and predicted mass function for the entire sub- 
halo population of the MW-sized halos within 
270 kpc (the largest distance of the observed 
satellite included in the comparison) and the 
subhalos with L > 2.6 x 10^ Lq (the smallest 
dSph luminosity included in the observed sam- 
ple) with luminosities assigned using eq. |5| I 
have excluded the Sagittarius dwarf from this 
comparison as its mo.6 mass has very large er- 
rors llSSl . The number of the predicted lumi- 
nous satellites was reduced by three to account 
for exclusion of the Sagittarius, SMC, and 



LMC from the comparison. The figure shows 
that the model predicts the range of mo.e quite 
similar to that measured for the observed lumi- 
nous dSphs. The shape of the mass function 
is also in reasonably good agreement with the 
data. Although there are somewhat more pre- 
dicted satellites at small masses, this is likely 
due to the somewhat larger virial mass of the 
simulated halos (« 2 - 3 x 10^^ Mq) compared 
to the mass of the Milky Way (« 10^^ Mq). We 
expect the number of subhalos to scale approx- 
imately linearly with host mass and so the dif- 
ference in the virial mass of the Milky Way and 
simulated halos can account for the difference 



with observations in Figure 12 There is some 
discrepancy at the largest msixs values, but it 
is not clear just how significant the discrepancy 
is given that the typical errors on the mo.e mea- 
surements for these galaxies are « 20 - 40%. 



Finally, figure 13 compares cumulative radial 
distribution of the observed "classical" Milky 
Way satellites within 280 kpc and satellites with 
similar luminosities and within the same dis- 
tance from their host halo in the model of eq. |5| 
The figure also shows the cumulative distribu- 
tion of all subhalos selected using their cur- 
rent Vmax- The predicted distribution of bright 
luminous satellites is somewhat more radially 
concentrated than the distribution of the Vmax- 
selected subhalos and is in reasonable agree- 
ment with the observed distribution both in its 
median and in the overall shape. 

Thus, the observed mo.s - L relation, the lu- 
minosity function, the mo.e mass function, and 
the radial distribution of the observed satel- 
lites can all be reproduced simultaneously with 
such a simple dwarf galaxy formation scenario. 
The observational uncertainties in these statis- 
tics are still quite large, which leaves significant 
freedom in the parameters of eq. [5] and in its 
functional formj^ It is also possible that all of 



It is quite possible that relation between luminosity and 
mass is more complicated than eq. |5] For example, the 
normahzation of the L - Myk acc relation can evolve with 
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Fig. 13 — The cumulative radial distribution of the ob- 
served "classical" Milky Way satellites (solid points con- 
nected by the solid line) within 280 kpc and satellites 
with similar luminosities and within the same distance 
from their host halo in the model of eq. [5] (dashed lines). 
The figure also shows the cumulative distribution of all 
subhalos selected using their current Vmax (dotted lines). 

these statistics may be reproduced in a drasti- 
cally different scenario. Nevertheless, the suc- 
cess of such simple model is encouraging and 
it is interesting to discuss its potential implica- 
tions. 

First of all, the equation |5] implies that all 
of the observed Milky Way dSph satellites had 
virial masses Myir.acc ^ 5 x 10^ Mq when they 
were accreted and these masses may span the 
range up to ~ 5 x 10'° Mq (the actual range de- 
pends sensitively on the slope of the L - Mvir.acc 
relation). This shows that progenitors of the ob- 
served satellites could have had a wide range 
of virial masses, even though the range of their 
mo.3 and mo g masses is narrow. 

An interesting implication of the value of 
lowest mass of the range of masses above is 

redshift.because luminosity may be determined both by 
the mass of the halo at the accretion epoch and by the pe- 
riod of time before its accretion during which it was suf- 
ficiently massive to withstand star formation suppressing 
processes. Such redshift dependence would be an extra 
parameter which would generate scatter in the L-Mvir,acc 
relation. 



that whatever gas the small-mass halos {M^-„ < 
5 X 10^ Mq) are able to accrete, it should re- 
main largely unused for star formation, and of 
course would not be blown away by supernovae 
(given that the model implies that such objects 
should have no stars or supernovae). If some 
of this gas is neutral, it can contribute to HI 
absorption lines in the spectra of quasars and 
distant galaxies. If this gas is enriched, it can 
also produce absorption lines of heavier ele- 
ments. At lower redshifts, the neutral gas in 
the otherwise starless or very faint halos could 
manifest itself in the form of the High Velocity 
Clouds (HVCs) abundant in the Local Group 
lfT60l[T6Tl[T62l [T63ll and around other galaxies. 

Second, as I noted above the slope of the 
L - Mvir relation required to explain the weak 
dependence of /no.a on luminosity is not surpris- 
ing, given what we know about the faint-end 
slope of galaxy luminosity function and what 
we expect about the slope of the mass function 
of their host halos in CDM scenario [I164II . The 
implied normalization of the L - Myh relation, 
however, is quite interesting. For example, it 
indicates that halo of Myir.acc = 10"' Mq should 
have luminosity of Ly = 1.6 x lO'' Lq. Convert- 
ing it to stellar mass assuming M^/Ly = 1 (ap- 
propriate for old populations, e.g. [I165B ) gives 
Mh. = 1.6 X 10^ Mq. Results of cosmological 
simulations with UV heating of gas show that 
halos of M ~ 10'" Mq should have been able 
to accrete almost all of their universal share of 
baryons, = (^i;,/Q„,)Mvir,acc « 1.7 x 10^ Mq 
(assuming Q.b/0.,„ « 0.17 suggested by the 
WMAP measurements (8l)> even in the pres- 
ence of reahstic UV heating IITM [TISll . The 
derived stellar mass thus implies that only = 
M,/M,i,,acc X (n^/nh) ^ 0.001 (i.e., 0.1%) of 
the expected baryon mass was converted into 
stars in such objects. Such small efficiency 
for systems accretion onto which is not sup- 
pressed by the UV heating implies that star 
formation is dramatically suppressed by some 
other mechanism. 

In fact, the implied efficiency of baryon con- 
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Fig. 14 — The efficiency of gas conversion into stars, de- 
fined as the fraction of baryon mass, expected if the halo 
accreted its universal fraction of baryons, converted into 
stars: Ft - (Qm/Qi)(M»/Mvii-,acc)- The points (as be- 
fore different symbols correspond to subhalos in three 
different simulated host halos) show the dependence of 
F„ on the maximum circular velocity of each subhalo at 
accretion according to luminosity assignment model of 
eq. [5] The short-dashed line shows the functional form 
expected if was controlled by the suppression of gas 
accretion due to UV heating of intergalactic gas (model 
3B of f79j), while long-dashed line shows scaling ex- 
pected in efficiency was set by supernova feedback (e.g., 

unii). 

version into the stars, F^, in this model is a 
steep, power law function of mass and circu- 



lar velocity (F, 
ure 



Wacc)' as shown in fig- 



14 The figure also shows the functional 



form one would have expected if the depen- 
dence of the efficiency on circular velocity 
was determined by the fraction of gas halos are 
able to accrete in the presence of the UV radi- 
ation. This functional form is almost identical 
to the fiducial model 3B of Koposov et al. [1791 
(I used slightly larger value for critical velocity 
because I use V^ax rather than the virial circular 
velocity used by these authors). The two mod- 
els have similar behavior at Vmax.acc ^ 30 km/s, 
but the UV heating model asymptotes to a fixed 
value of = 10"^ for more massive systems. 
This model would therefore underpredict lumi- 
nosities of the most massive satellites in MW- 



sized halos, which was noticed by Koposov et 
al. in its failure to reproduce the bright end of 
the satellite luminosity function. Moreover, the 
estimates of the efficiency for more lumi- 
nous galaxies, such as the Milky Way, are in 
the range F, -0.05-0.2 (HB [Ml M , 
which would lie roughly on the continuation of 
the relation in Fig. [14] to larger circular veloci- 
tiesli] 

These considerations indicate a very interest- 
ing possibility that while the UV heating can 
mediate accretion of gas into very small mass 
halos, the efficiency with which the accreted 
gas is converted into stars in the known lu- 
minous galaxies is dramatically suppressed by 
some other mass-dependent mechanism. This 
strong suppression operates not only for halos 
in which gas accretion is suppressed but for ha- 
los of larger masses as well. Note that this 
suppression mechanism is unlikely to be due 
to blowout of gas by supemovae, which is ex- 
pected to give Ft oc V^^^ (e.g., 1113111 ). a much 



shallower relation than the scaling in Fig. 14 



While discussion of the nature of this sup- 
pression mechanism is outside the scope of this 
paper, the rapidly improving observational data 
on the satellite population should shed light on 
the possible mechanisms. 

The exercise presented in this section illus- 
trates just how powerful the combination of 
luminosity function and radial distribution of 
satellites, high quality resolved kinematics data 
and inferred dynamical constraints on the total 
mass profile, measurements of star formation 
histories and enrichment histories can be in un- 
derstanding formation of dwarf galaxies. The 
rapidly improving constraints on the mass pro- 
files of the dSph galaxies down to the small- 
est luminosities [l83l I73ll should further con- 
strain the range of subhalo masses hosting the 
observed satellites and, by inference, the effi- 



1 



The Ft - Vniax relation could plausibly flatten at larger 



Vm-dx, as expected from the halo modeling of the galaxy 
population (e.g., 1.1 64 j ) 
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ciency of star formation in such small halos. 

The hints that star formation efficiency is ac- 
tually a monotonic function of halo mass from 
galaxies such as the Milky Way to the faintest 
known galaxies, such as Segue 1, indicates that 
the physics learned from the "near-field cos- 
mology" studies of the nearest dwarfs can po- 
tentially give us important insights into forma- 
tion of more massive galaxies as well. 
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